30 research outputs found

    The Genomes of Oryza sativa: A History of Duplications

    Get PDF
    We report improved whole-genome shotgun sequences for the genomes of indica and japonica rice, both with multimegabase contiguity, or almost 1,000-fold improvement over the drafts of 2002. Tested against a nonredundant collection of 19,079 full-length cDNAs, 97.7% of the genes are aligned, without fragmentation, to the mapped super-scaffolds of one or the other genome. We introduce a gene identification procedure for plants that does not rely on similarity to known genes to remove erroneous predictions resulting from transposable elements. Using the available EST data to adjust for residual errors in the predictions, the estimated gene count is at least 38,000–40,000. Only 2%–3% of the genes are unique to any one subspecies, comparable to the amount of sequence that might still be missing. Despite this lack of variation in gene content, there is enormous variation in the intergenic regions. At least a quarter of the two sequences could not be aligned, and where they could be aligned, single nucleotide polymorphism (SNP) rates varied from as little as 3.0 SNP/kb in the coding regions to 27.6 SNP/kb in the transposable elements. A more inclusive new approach for analyzing duplication history is introduced here. It reveals an ancient whole-genome duplication, a recent segmental duplication on Chromosomes 11 and 12, and massive ongoing individual gene duplications. We find 18 distinct pairs of duplicated segments that cover 65.7% of the genome; 17 of these pairs date back to a common time before the divergence of the grasses. More important, ongoing individual gene duplications provide a never-ending source of raw material for gene genesis and are major contributors to the differences between members of the grass family

    A Prognosis Classifier for Breast Cancer Based on Conserved Gene Regulation between Mammary Gland Development and Tumorigenesis: A Multiscale Statistical Model

    Get PDF
    National Basic Research Program of China [2010CB945004]; National Natural Science Foundation of China [30772546]Identification of novel cancer genes for molecular therapy and diagnosis is a current focus of breast cancer research. Although a few small gene sets were identified as prognosis classifiers, more powerful models are still needed for the definition of effective gene sets for the diagnosis and treatment guidance in breast cancer. In the present study, we have developed a novel statistical approach for systematic analysis of intrinsic correlations of gene expression between development and tumorigenesis in mammary gland. Based on this analysis, we constructed a predictive model for prognosis in breast cancer that may be useful for therapy decisions. We first defined developmentally associated genes from a mouse mammary gland epithelial gene expression database. Then, we found that the cancer modulated genes were enriched in this developmentally associated genes list. Furthermore, the developmentally associated genes had a specific expression profile, which associated with the molecular characteristics and histological grade of the tumor. These result suggested that the processes of mammary gland development and tumorigenesis share gene regulatory mechanisms. Then, the list of regulatory genes both on the developmental and tumorigenesis process was defined an 835-member prognosis classifier, which showed an exciting ability to predict clinical outcome of three groups of breast cancer patients (the predictive accuracy 64 similar to 72%) with a robust prognosis prediction (hazard ratio 3.3 similar to 3.8, higher than that of other clinical risk factors (around 2.0-2.8)). In conclusion, our results identified the conserved molecular mechanisms between mammary gland development and neoplasia, and provided a unique potential model for mining unknown cancer genes and predicting the clinical status of breast tumors. These findings also suggested that developmental roles of genes may be important criteria for selecting genes for prognosis prediction in breast cancer

    A Prognosis Classifier for Breast Cancer Based on Conserved Gene Regulation between Mammary Gland Development and Tumorigenesis: A Multiscale Statistical Model

    No full text
    <div><p>Identification of novel cancer genes for molecular therapy and diagnosis is a current focus of breast cancer research. Although a few small gene sets were identified as prognosis classifiers, more powerful models are still needed for the definition of effective gene sets for the diagnosis and treatment guidance in breast cancer. In the present study, we have developed a novel statistical approach for systematic analysis of intrinsic correlations of gene expression between development and tumorigenesis in mammary gland. Based on this analysis, we constructed a predictive model for prognosis in breast cancer that may be useful for therapy decisions. We first defined developmentally associated genes from a mouse mammary gland epithelial gene expression database. Then, we found that the cancer modulated genes were enriched in this developmentally associated genes list. Furthermore, the developmentally associated genes had a specific expression profile, which associated with the molecular characteristics and histological grade of the tumor. These result suggested that the processes of mammary gland development and tumorigenesis share gene regulatory mechanisms. Then, the list of regulatory genes both on the developmental and tumorigenesis process was defined an 835-member prognosis classifier, which showed an exciting ability to predict clinical outcome of three groups of breast cancer patients (the predictive accuracy 64∼72%) with a robust prognosis prediction (hazard ratio 3.3∼3.8, higher than that of other clinical risk factors (around 2.0–2.8)). In conclusion, our results identified the conserved molecular mechanisms between mammary gland development and neoplasia, and provided a unique potential model for mining unknown cancer genes and predicting the clinical status of breast tumors. These findings also suggested that developmental roles of genes may be important criteria for selecting genes for prognosis prediction in breast cancer.</p> </div

    The 835 prognosis classifier acts as a powerful predictor of clinical outcome in 78 breast cancer patients.

    No full text
    <p><b>A.</b> 78 breast cancer samples were first classified by the expression profiles of the 835 prognosis classifier, using unsupervised classification. The survival curve of the two groups was then compared with Kaplan Meier analysis to define clinical outcome (lower panel). Accuracy of classification was assessed with Fisher exact test (upper table). <b>B.</b> The distribution of tumors risk factors in the four groups classified by clinical metastasis and by the 835 prognosis classifier. <b>C.</b> The prognostic value of the 835 prognosis classifier and tumor risk factors.</p

    The 835 prognosis classifier could predict clinical outcome in a large set of breast cancer patients.

    No full text
    <p>The intrinsic dataset was applied to 144 node positive and 151 node negative primary breast tumors. The accuracy of prediction (<b>A</b>) or the prognosis value (<b>B</b>) of 835 prognosis classifier and tumor risk factors was assessed by the same approach as described in the legend of <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0060131#pone-0060131-g005" target="_blank">Fig. 5B</a>.</p

    Identification of genes associated with the developmental phases of growth, lactation, and involution among the mammary gland developmentally associated gene subset.

    No full text
    <p><b>A.</b> The developmentally associated genes were clustered into three groups by Principal Component Analysis. Expression profiles of genes in mammary pregnancy cycle are represented as dots in PC1 (1<sup>st</sup> principal component axis) and PC2 (2<sup>nd</sup> principal component axis). All probe sets were grouped into three groups: growth (PC1>0), involution (PC1<0&PC2>0) and lactation (PC1<0&PC2<0) based on the number of genes that have peak expression at a particular developmental time (showed in B). <b>B.</b> The time of peak expression for each developmentally associated gene was plotted on a histogram and classified according to the developmental phase (growth, yellow; lactation, blue; involution, purple). The column represents the number of genes that have peak expression at a particular developmental time. <b>C.</b> The frequency of a literature-based cancer modulated genes in the gene subsets associated with the three different stages of mammary gland development. The “growth” group contained more literature-based cancer modulated genes (20%) than the “lactation” (14.7%) and the involution (17%) groups (<i>p</i><0.05).</p

    Defining the 835 prognosis classifier from the developmentally associated genes based on their expression in tumors.

    No full text
    <p>For each developmentally associated gene, we first counted the number of the breast cancer datasets in which it was “altered” in expression. Based on this database number, all developmental genes were then grouped into six subsets (Sub0, Sub1, Sub2, Sub3, Sub4, and Sub5). The percentage of a literature-based cancer modulated genes in each subset is shown in table (<b>A</b>) and histogram (<b>B</b>). The results of non-developmental genes with same assay method are shown as a control. The details are described in the text.</p

    Definition of mammary gland developmentally associated genes.

    No full text
    <p>Overview of data processing for defining the developmentally associated gene subset was discribled in <b>A.</b> The probes were filtered systematically with different cutoffs:p value of gene expression among different time points and the optimal fold of maximum/minimum expression of a gene at different developmental time points, which should have a maximum Odds ratio of literature-based mammary gland-cycle associated genes in developmentally associated and non-developmentally associated genes subset. A higher Odds ratio means that a greater number of developmental genes were correctly classified. The figure <b>B</b> shows the curve of Odd ratios in the developmentally associated and non-developmentally associated genes subset defined by a cutoff with different ratio of maximum to minimum expression of each gene at different time points in the developmental progress. <b>C.</b> Fisher exact test to assess the frequency of validated cancer gene expression in the group of mammary gland developmentally associated genes. The validated cancer genes were obtained from previously published papers (<a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0060131#pone.0060131.s005" target="_blank">File S5</a>).</p

    The enrichment of ontology in 835 intrinsic genes.

    No full text
    <p>EASE score<0.05.</p>#<p>genes with red word are cancer mutant gene identified in reference (Nat Rev Cancer,4(3):177).</p
    corecore